Towards Transparent Systems: Semantic Characterization of Failure Modes

نویسندگان

  • Aayush Bansal
  • Ali Farhadi
  • Devi Parikh
چکیده

In this document, we give a more detailed explanation of the accuracy (ACC) vs. frequency-of-use (FOU) metric introduced in the main paper. We show the ACC vs. FOU curves for a human user1 as well as automatic failure prediction using our specification sheets. The area under these curves were reported in Tables 2 and 3 in the main paper (and are replicated here in Tables 1 and 2). Recall that one use case for our proposed specification sheets is the following: a user of a system can use the specification sheet to decide when to trust the system, and when not to trust it. If an input image falls in one of the scenarios listed in the specification sheet, the user would not trust the system. In other words, the user would not use the system for that input image, because according to the specification sheet, the system is like to fail on this input image anyway. Clearly, a specification sheet that lists many scenarios on it will capture many of the failure modes and perhaps (incorrectly capture) several scenarios that are not failures. As a result, the user will not be able to use the system very often (i.e. low frequency-of-use) which may be annoying or counter-productive for the user. But whenever he does use the system, it will likely be very accurate. This trade-off between the accuracy of the system (ACC) and frequency-of-use (FOU) is thus very relevant to the user – perhaps more so than the traditional precision-recall trade-off (results for the latter are shown in the paper, and may be more relevant for the use case where researchers are using our specification sheets to better understand their systems). If the input pre-trained classification system whose mistakes we are characterizing has a classification accuracy of 70%, a perfect specification sheet would ensure that all the mistakes (30% of the images) are detected by it, while the remaining 70% of the images are left un-flagged. Hence, a user using the specification sheet would ignore the vision system’s response (i.e. not use it) 30% of the time. Therefore, anywhere from 0 to 0.7 FOU, the vision system would have 100% accuracy. If the user chooses to work with a specification sheet that flags only 15% of the images (i.e. user uses the system 85% of the time), the accuracy of the vision system would be lower, somewhere between 70% and 100% at 0.85 FOU. If the user insists on ignoring the specification sheets and uses the system 100% of the time, the accuracy of the system will naturally be 70% (at FOU = 1.0). To capture this trade-off between accuracy of the system when used (ACC) and frequency-of-use (FOU), we plot ACC vs. FOU curves as shown in Figures 1 and 2.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A NEW APPROACH IN FAILURE MODES AND EFFECTS ANALYSIS BASED ON COMPROMISE SOLUTION BY CONSIDERING OBJECTIVE AND SUBJECTIVE WEIGHTS WITH INTERVAL-VALUED INTUITIONISTIC FUZZY SETS

Failure modes and effects analysis (FMEA) is a well-known risk analysis approach that has been conducted to distinguish, analyze and mitigate serious failure modes. It demonstrates the effectiveness and the ability of understanding and documenting in a clear manner; however, the FMEA has weak points and it has been criticized by some authors. For example, it does not consider relative importanc...

متن کامل

Reliability Analysis of K-Out-Of N: G Machining Systems with Mixed Spares and Multiple Modes of Failure (TECHNICAL NOTE)

This paper deals with the transient analysis of K-out-of-N: G system consisting of Noperatingmachines. To improve system reliability, Y cold standby and S warm standbys spares areprovided to replace the failed machines. The machines are assumed to fail in multiple modes. At leastK-out-of-N machines for smooth functioning of the system. Reliability and mean time to failure areestablished in term...

متن کامل

An Investigation into the Pull-out Failure Mechanisms of Suction Caissons

This paper reports results from an investigation into the suction caissons failure mechanisms under vertical pull-out loads. An insight to the failure mechanisms of suction caissons paves the path for developing analytical solutions to their pull-out capacity. The numerical models of suction caissons have first been calibrated by and verified against several experimental data from other researc...

متن کامل

Towards constructing an Integrative, Multi-Level Model for Cognition: The Function of Semantic Networks

Integrated approaches try to connect different constructs in different theories and reinterpret them using a common conceptual framework. In this research, using the concept of processing levels, an integrated, three-level model of the cognitive systems has been proposed and evaluated. Processing levels are divided into three categories of Feature-Oriented, Semantic and Conceptual Level based o...

متن کامل

Failure Mode and Analysis of the Bonded/bolted Joints between a Hybrid Fibre Reinforced Polymer and Aluminium Alloy

Composites are being used extensively in several engineering applications. However, the efficiency of the joints used in joining composites and metals can be improved. To move towards a sustainable and environment friendly future, natural fibre composite material was used. Towards the above objective, research work was carried out for the assembly between a composite and aluminium. Three differ...

متن کامل

Evaluation of Failure Causes in Employing Hospital Information Systems

Today, the information systems play a critical role in business for each organization. Like other organizations, hospitals use information systems for data collection, data storage, data processing and the like to have long-term and short-term achievements. Despite the very benefits of implementing HIS and its costly implementation, the HIS project sometimes fails. The importance of the HIS fai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014